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Abstract 

Recently, it has been shown that CSMA algorithms which use queue length-based link weights can 
achieve throughput optimality in wireless networks. In particular, a key result by Rajagopalan, Shah, 
and Shin (2009) shows that, if the link weights are chosen to be of the form loglog(g) (where q is the 
queue-length), then throughput optimality is achieved. In this paper, we tighten their result by showing 
that throughput optimality is preserved even with weight functions of the form log(q)/g(q), where g(q) 
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can be a function that increases arbitrarily slowly. The significance of the result is due to the fact that 



weight functions of the form log(q)/g(q) seem to achieve the best delay performance in practice. 
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I. INTRODUCTION 

Efficient operation of wireless networks has always been a difficult task due to the inherent 
broadcast nature of the wireless medium. Transmission by a user can cause an interference 
for its neighbors. If two neighboring users transmit at the same time, the Signal-to-Noise-plus- 
Interference Ratio (SINR) of the users' links could go below the required SINR for the successful 
decoding of data packets at their corresponding receivers. In this case, we say that their messages 
collide with each other. Therefore, multiple users can transmit at the same time provided that 
they do not cause significant interference for each other. The users need a distributed Medium 
Access Control (MAC) protocol to determine which users should transmit which makes the 
optimal operation even harder. 
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CSMA (Carrier sense Multiple access) type protocols are an important class of MAC protocols 
due to their simplicity of implementation, and have been widely used in practice, e.g., in WLANs 
(IEEE 802.11 Wi-Fi) or emerging wireless mesh networks. In these protocols, each user listens 
to the channel and can transmit, with some probability, only when the channel is not busy. 
Despite the extreme simplicity of the CSMA-type algorithms, their efficiency have been always 
questionable. In this paper, we consider efficient design of such CSMA-type algorithms that can 
achieve maximum throughput and good delay performance. 

The wireless network can be modeled by its conflict graph (or interference model), where two 
communication links form two neighboring nodes in the conflict graph, if they cannot transmit 
simultaneously. The well-known result of Tassiulas and Ephremides [1] states that the Maximum 
Weight Scheduling (MWS) algorithm is throughput optimal, where weights are queue-lengths. 
However, for a general network, MWS involves finding the maximum weight independent set of 
the conflict graph in each time slot which is a formidable task, and hence, is not implementable. 
This has led to a rich amount of literature on design of approximate algorithms to alleviate the 
computational complexity of the MWS algorithm. A recent result in this direction of research 
can be found in ||9l. 

Recently, it has been shown that it is possible to design CSMA algorithms that are throughput- 
optimal. Reference ifTOl develops an algorithm that adaptively chooses the CSMA parameters un- 
der a time-scale separation assumption, i.e., the CSMA Markov chain converges to its stationary 
distribution instantaneously compared to the time-scale of adaptation of the CSMA parameters. 
This time-scale separation assumption was later verified by a stochastic approximation type 
argument |fTT|. [fT2|. In particular, an important recent work by Rajagopalan, Shah, and Shin 01 
builds an algorithm upon a Metropolis-Hastings sampling mechanism (Glauber dynamics over 
the set of feasible schedules) along with selection of link weights to be of the form loglog(g) (q 
is the queue-length). To establish the efficiency of the algorithm, they present a novel adiabatic- 
like theorem for the underlying queueing network: by choosing the weights to be of the form 
log log q, the underlying Markov chain behaves in an adiabatic manner such that it remains close 
to its equilibrium distribution. Similar algorithms with fixed link weights were developed earlier 
in US] and IfBl. 

Although a weight function of the form log log q stabilizes the network, the resulting scheduling 
algorithm reacts very slowly to changes in queue lengths which, in turn, results in a poor delay 
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performance. In this paper, we show that, by choosing weights to be of the form logq/g(q), 
a network adiabatic property still holds. As a result, we prove that the CSMA algorithms with 
such weight functions are throughput optimal. The function g can grow arbitrarily slowly such 
that log q/g(q) behaves very similarly to log(g). The significance of the result is due to the fact 
that such weight functions seem to also achieve the best delay performance in practice. 

The remainder of the paper is organized as follows. In section HH we briefly describe our 
model of wireless networks. The main results of the paper are presented in section Unl Section 
[IV] is devoted to the proofs. Some simulation results are presented in section [V] Finally, we will 
end the paper with some concluding remarks. 

II. Model of Wireless Network 

Consider a set of nodes where each node could be a source and/or a destination for another 
source. For now, we assume a single hop communication scheme but the results are naturally 
extendable to the multihop case. Therefore, there are iV communication links, each of which 
corresponds to a source-destination pair. 

Time is slotted and arrival process to each link is assumed to be discrete-time, where ai(t) 
is the number of packets arriving at link / in time slot t. For simplicity, assume that {a/(t)}^ , 
for I = 1,..,N, are independent Bernoulli processes with parameter A = [A/; / = 1,..,N]. In 
each time slot, one packet could be successfully transmitted over a link if the link Signal-to- 
Interference-plus-Noise-Ratio (SINR) is high enough. We use the notion of the conflict graph 
to capture the interference constraints or technological oneJ3. Let G(V, E) denote the conflict 
graph of the wireless network, where each node in the conflict graph is a communication link 
in the wireless network. There is an edge (I, k) G E between nodes / and k if simultaneous 
transmissions over communication links I and k are not successful. Therefore, at each time slot, 
the active links should form an independent set of G, i.e., no two scheduled nodes can share an 
edge in G. Formally, a schedule can be represented by a vector X = [x s : s = 1, N] such that 
x s G {0, 1} and Xi + Xj < 1 for all G E. let M. denote the set of all feasible schedules. 

Each link I is associated with a queue qi, where the queue dynamics are given by 

qi (t) = (q(t- 1) -xKt)) + + a ; (t) 



'For example, a node cannot transmit and receive at the same time. 
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for t > and I = 1, N. The vector of queue lengths is denoted by q(t) = [qi(t) : I = 1, N}. 

A scheduling algorithm is a policy to determine which schedule to be used in each time slot. 
The capacity region of the network is defined to be the set of all arrival rates A that can be 
supported by the network, i.e., for which there exists a scheduling algorithm that can stabilize 
the queues. It is known, e.g.[[TJ, that the capacity region is given by 



where Co(.) is the convex hull operator. When dealing with vectors, inequalities are interpreted 
component-wise. 

A scheduling algorithm is throughput-optimal if it can stabilize the network for any arrival rate 
in A. An important class of the throughput- optimal algorithms is the maximum-weight scheduling 
(MWS) algorithm where at each time slot t, the scheduling decision pit) satisfies 



where Wi(t) is the weight of link / at time slot t. In [1], it was proved that the MWS algorithm 
is throughput-optimal for wi(t) = qi(t). A natural generalization of the MWS algorithm in 
uses a weight /(<#(.)) instead of <#(.) with the following properties. 

1) / : [0, oo] — > [0, oo] is a nondecreasing continuous function with limg^oo f(qi) = oo. 

2) Given any Mi, M 2 > 0, and < e < 1, there must exist a Q < oo such that for qi > Q: 



Lemma 1. Suppose f is a strictly concave and monotonically increasing function, with /(0) = 0, 
then it satisfies the conditions (1) and (2) above. 

See the appendix for the proof. In this paper, we use a function / with properties of Lemma 



A = {A > : Bp e Co(M), \<p} 



N 




1=1 



(1 - e)f( qi ) < f( qi - Mi) < f( qi + M 2 ) < (1 + e)f( qi ) 



D 



III. Main Result 



A. Basic Algorithm 



For our algorithm, we choose the wight of link / to be 



Wi(t) = max (wi(t),w min (t)) 



(1) 
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where 

Wl (t) = f( qi (t)), (2) 

Wminif) = ^jyf(qmax(t)), (3) 

= log(l+x) 

and q max (t) is the length of the largest queue in the network at time t and assumed to be known. 
The function g(x) is a strictly increasing function chosen such that / is a strictly concave 
increasing function, for example g(x) = log(e + log(l + x)) or g(x) = (log(l + x)) 6 for some 
< 9 < 1. In this paper, all log's are in base e 

Consider the conflict graph G(V, E) of the network as defined earlier. Denote the neighbors 
of i by a set Af(i) = {k G V : (i,k) G i?}. At each time slot t, a node 2 is chosen uniformly at 
random, with probability i then 
(i) If ^(t - 1) = for all nodes j G then x^t) = 1 with probability and 

a:^) = with probability 1+exp 1 ( - (f)) . 
Otherwise, Xj(t)=0. 
(iii) acj- (t) = — 1) for all j ^ i. 

The following theorem states the main result regarding the throughput optimality of the algorithm. 

Theorem 1. Consider any e > 0. The algorithm can stabilize the network for any A G (1 — e)A, if 
the weight function is chosen to be in the form of fix) = „}t x ^ . The function g(x) is a strictly 
increasing function chosen such that f is a strictly concave increasing function. In particular, 
the algorithm with the following weight functions is throughput-optimal: f(x) = j^^^^f^j 
or f(x) = (log(l + x)Y~ e for some < 6 < 1. 

B. Distributed Implementation 

The basic algorithm is based on Glauber-Dynamics with one site update at each time. For 
distributed implementation, we need a randomized mechanism to select a link uniformly at each 
time slot. We use the Q-CSMA idea Q to perform the link selection as follows. Each time 
slot is divided into a control slot and a data slot. The control slot, which is much smaller than 
the data slot, is used to generate a transmission schedule for the data slot. First, the network 
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selects a set of links m{t) that do not conflict with each other. Then, it performs the Glauber- 
Dynamics updates, in parallel, over links m{t) to produce a transmission schedule X(t) for data 
transmission. m{t) is called the decision schedule at time t. For example, a simple randomized 
mechanism to generate m(t) is as follows. In control slot t, each link I sends an INTENT 
message with probability 1/2. If / does not hear any INTENT messages from its neighboring 
links J\f(l), it will be included in m(t), otherwise it will not be included in m(t). Therefore, by 
the end of the control slot, any feasible decision schedule m(t) C M. could be selected with a 
positive probability a(m(t)). Once a link knows whether it is included in the decision schedule, 
it can determine its state in the data slot based on its carrier sensing information (i.e., whether 
its conflicting links were active in the previous data slot) and the activation probability for the 
current slot (based on its queue length). 

To determine the weight at each link I, q ma x{t) is needed. Instead, each link / can maintain 
an estimate of q ma x(t)- We can use the procedure suggested in H| to estimate q max {t), and use 
Lemma 2 of 01 to complete the stability proof. So we do not pursue this issue here. In practical 
networks log(l + q max ) is small and we can use the weight function / directly, and thus, 
there may not be any need to know q max (t). 

Corollary 1. Under the weight function f specified in TheoremUl the distributed algorithm can 
stabilize the network for any A G (1 — e)A. 

IV. Proof of the Main result 

Before we start the proof, some preliminaries, regarding stationary distribution and mixing 
time of Glauber dynamics, are needed. 

A. Preliminaries 

Consider a time-homogenous discrete-time Markov chain over the finite state-space M.. For 
simplicity, we index the elements of Ai by 1, 2, r, where r = \JA\. Assume the Markov chain 
is irreducible and aperiodic, so that a unique stationary distribution n = [ir(i), ...,%(r)} always 
exists. 

1 ) Distance between probability distributions: First, we introduce two convenient norms on 
W that are linked to the stationary distribution. Let £ 2 (n) be the real vector space W endowed 
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with the scalar product 

r 

(z,y)n = ^2z(i)y(i)ir(i). 

i=l 

Then, the norm of z with respect to rr is defined as 

/r \V2 

IMk = ( X^WMO I • 

We shall also use ^ 2 (^), the real vector space R r endowed with the scalar product 

r 1 

(z,y)i = ^z(i)y{i) — 

i=i v ' 

and its corresponding norm. For any two strictly positive probability vectors fx and n, the 
following relationship holds 

H 1 1I . || H 

\\p — 7T||l = || l\\ n > 2\\fl — 7t\\tv, 

* 71 

where \\n — p\\rv is the total variation distance 
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\n-fi\\TV = gLR'J-WJ 

2) Glauber dynamics: Consider a graph G(V,E). Glauber dynamics is a Markov chain to 
generate the independent sets of G. So, the state space M. consists of all independent sets of G. 
Let |V| = N. Given a weight vector W = [w\,W2, ...,wn], at each time t, a node % is chosen 
uniformly at random, with probability ^, then 

(i) If Xj(t — 1) = for all nodes j e then = 1 with probability i^^-.y or 

Xi (t) = with probability 1+(a ^ (g<) • 
Otherwise, Xj(t)=0. 
(iii) = — 1) for all j 7^ i. 

The corresponding Markov chain is irreducible, aperiodic, and reversible over M, and its 
stationary distribution is given by 

tt(p) = ^exp(^Wj); pe.M, (5) 

where Z is the normalizing constant. 

The basic algorithm uses a time-varying version of the above Glauber dynamics, where the 
weights change with time. This yields a time-inhomogeneous Markov chain but we will see that, 
for the proper choice of weights, it behaves similarly to the Glauber dynamics. 



3) Mixing time of Glauber dynamics: The convergence to steady state distribution is geometric 
with a rate equal to the second largest eigenvalue modulus (SLEM) of the transition matrix as 
it is described next (6]|. 

Lemma 2. Let P be an irreducible, aperiodic, and reversible transition matrix on the finite state 
space Ai with the stationary distribution tt. Then, the eigenvalues of P are ordered in such a 
way that 

Ai = 1 > A 2 > ... > A r > -1, 
and for any initial probability distribution p on Ai, and for all n > 1 

||//oP" - tt|| i < cr n \\ij, — 7r|| 1 , (6) 

7T TV 

where o = max{A 2 , |A r |} is the SLEM of P. 

The following Lemma gives an upper bound on the SLEM o(P) of Glauber dynamics. 

Lemma 3. For the Glauber Dynamics with the weight vector W on a graph G(V, E) with 
\V\=N, 



1Q N exp(ANw max y 
where w max = max ieV Wi- 

We define the mixing time as T = so 

T< 16 Jv exp(4iV^ max ) (7) 

Simple calculation, based on Lemma [2l reveals that the amount of time needed to get close to 
the stationary distribution is proportional to T. 

B. A key lemma 

At any time slot t, given the weight vector W(t) = [wi(t), ■u)jv(i)], the MWS algorithm 



should solve 



max ) wAt), 



instead, our algorithm tries to simulate a distribution 



(P) = 7 exp(^ Wi{t)); peAt, (8) 



i&p 
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i.e., the stationary distribution of Glauber dynamics with the weight vector W(t) at time t. 

Let P t denote the transition probability matrix of Glauber dynamics with the weight vector 
W(t). Also let fi t be the true probability distribution of the inhomogeneous-time chain, over the 
set of schedules Ai, at time t. Therefore, we have [i t = fa-iPt- Let n t denote the stationary 
distribution of the time-homogenous Markov chain with P = P t as in ®. By choosing proper 
w m i n and /(.), we aim to ensure that fi t and n t are close enough, i.e., 

IK - A**||tv < S/A 

for some 5 arbitrary small. 

Let w max (t) = f(q m ax(t)). The following lemma gives a sufficient condition under which the 
probability distribution of the inhomogeneous Markov chain is close to the stationary distribution 
of the homogenous chain. 

Lemma 4. Given any 5 > 0, |K — AH||tv < f holds for all t > t*, if 

UtTt+i < 5/16 for all t > 0, (9) 

where 

(i) a t = 2Nf\f-\w mm (t + l))-l), 

(ii) t* is the smallest t such that 

* 1 

T2 > ln ( 4 /^) + N (^maM + l0g2)/2, (10) 
fc=l k 

and Tt+i is the mixing time of the Glauber dynamics with the weight vector W(t + 1). 

Lemma H] states a condition under which |K ~ AH 1 1 TV < f f° r a U t >t*. Instead, assume that 
© holds only when ||g(t)|| > qut. for a constant q th > 0. Let t\ be the first time that ||g(t)|| hits 
q t h- Then, after that, it takes t* time slots for the chain to get close to rr t if ||g(i)|| remains above 
q t h for ti < t < ti + 1*. Alternatively, we can say that |K — /h||tv < 2 if \\q(t)\\ > q t h + 1* 
since at each time slot at most one departure can happen and this guarantees that ||g(t)|| > qth 
for, at least, the past t* time slots. This immediately implies the following Lemma that we will 
use in the proof of the main result. 



2 In this paper, ||g(£)|| = ||<?(i)||oo = max;g;(t) = q m aw(i). 
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where 



(i) a t = 2Nf'(f- 1 (w,, 



mm 



(*+!)) 



1) 



(ii) T t < 1Q N exp(4Nw max {t)) 



(ii) t* is the smallest t such that 



* >ln(4/5) + iV(/(^) + log2)/2. 



E 



(12) 



k=tr.\\q{tl)\\=qth 



k 



In the above Lemma, condition (ii) is based on the upper bound of © and the fact that 

In other words, Lemma \5\ states that when queue lengths are large, the observed distribution 
of the schedules is close to the desired stationary distribution. 

Remark 1. We will later see that, to satisfy condition ([77]) and to find a finite t* satisfying 
([72]) in Lemma\5\ the function /(.) cannot be faster than log(.). In fact, the function f must be 
slightly slower than log(.) to make the weight dynamics slow enough such that the distribution 
of the schedules remains close to the stationary distribution. 

Remark 2. The above Lemma is a generalization of Lemma 12 (Network Adiabatic Theorem) 
of !3\l- Here we consider general functions /(.), whereas ^ considers a particular function 
loglog(.). The generalization allows us to use functions which are close to log(.) and perform 
much better than loglog(.) in simulations. The proof of Lemma \4\ is presented in the appendix. 

C. Throughput optimality 

We will use the following Lemma [2] to prove the throughput-optimality of the algorithm. 

Lemma 6. For a scheduling algorithm, if given any < e < 1 and < 5<1, there exists a 
B(6, e) > such that: in any time slot t, with probability larger than 1 — 5, the scheduling 
algorithm chooses a schedule X(t) G M. that satisfies 
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whenever ||q(£)|| > B(5,e), then the scheduling algorithm is throughput-optimal. 

Remark 3. Throughput optimality in Lemma \6\ means that, for all the rates inside the capacity 
region, system will be stable in the mean (See ^ for more details), i.e., 



1 

lim sup — E 



t=0 



N 



2 



£/%(*)) 



i=l 



< OO. 



(13) 



In our setting, the queuing system is an irreducible and aperiodic Markov chain, and therefore 
stability -in-the mean property rfTil) implies that the Markov chain is also positive recurrent /[7|/ . 

Let w*(t) = m&x p€ M ^2 iep Wi(t). Let us define the following set: 

Xt = {peM: ^2wt(t) < (1 - e)w*{t)} 

Therefore, we need to show that 

Pt(xt) = v*{p) < s 

P^Xt 

for large enough. Suppose /(.) and w min are chosen such that a t T t+ i < 5/16 whenever 

||g(t)|| > q t h for some constant q th > to be determined later. Then, it follows from Lemma [5] 
that whenever \\q(t)\\ > q th + £*, 

2\\tH ~ ntWrv < 8/2, 



and consequently, 



Thus, 



J2Mp)~n(p)\< 6/2. 
peM 



< 5/2 



which yields 

J2pt(p) < ^7r f (p) + 5/2. 

pext pext 

Therefore, to ensure that ^]„ gY Pt(p) < 8, it suffices to have 



J2Mp)<5/2. 



P^Xt 
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But 



where 



So 



Wi(t) = max{wi(t) , w min (t)} < Wi(t) +w min (t). 



P^Xt 



P^Xi 



i£p 



~z exp E exp(\p\w min (t)) 



P^Xt 



i&p 



< V — exp((l - e)w*(t)) exp(iV«; min (0) 



and 



Therefore, 



Z t = ^ exp(^Wi(t)) > exp(J^tUi(*)) > e 



^TTt(p) < 2 Af exp(iVuw(t) -ew*(t)) 

and u>*(i) > w max (t). So, it suffices to have 

2 exp(iVw miri (t) €W max 

(*)) < */2 

when ||g(t)|| > + £*. The choice of w min (t) = ^w max {t), satisfies the above condition for 
> B, where 



B = max < q th + t* J~ 



x ^iVlog2 + log- 
^72 



(14) 



D. A class of weight functions with the maximum throughput property 

In this section, we describe a family of weight functions / that yield a maximum throughput 
algorithm. 

The function / needs to satisfy Lemma [5l Roughly speaking, since the mixing time T 
is exponential in w max , f'(f~ l (w m i n )) must be in the form of e~ Wmin ; otherwise it will be 
impossible to satisfy a t T t+ i < 5/16 for any arbitrarily small S as — >* °°- The only 
function with such a property is the log(.) function. In fact, it turns out that / must grow 
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slightly slower than log(.) as we show next to satisfy (fTTT) . and to ensure the existence of a finite 
t* in Lemma [51 

Consider weight functions of the form fix) = log ^ +x ^ where g(x) is a strictly increasing 
function, chosen such that / satisfies the conditions of Lemma CD For example, by choosing 
functions that grow much slower than log(l + x), like g(x) = log(e + log(l + x)), we can make 
f(x) behave approximately like log(l + x) for large ranges of x. 

Assume g(0) > 1, then 

m < j^. d5) 

The inverse of / cannot be expressed explicitly, however, it can be written as 

f- 1 (x) = exp(xg(f-\x)))-l. (16) 



Therefore, 



f'(f-\w mm )-l) < — (17) 

J \^min ) 



1 



(18) 



exp(w min g(f- 1 (w min ))) - 1' 
Using (TT7T ), the conditions of Lemma [5] are satisfied if there exists a q t h large enough such that 

2A16 iV exp(4Aw ma:!; ) 7 77^77 777 - < S/16 (19) 

for \\q(t)\\>q th . 

Using (fT6l) and noting that w min = j^w max , ( IT9b can be written as 



2A16 7V exp I w r , 



g{f {Wmin)) 



, 1 + — ^ r ) < 5/16 (20) 

e \J \ J >min) / 

Consider fixed, but arbitrary, N and e. As q max — > 00, w max — > 00, and consequently w m i n — > 00 

and f~ 1 {w min ) — > 00. Therefore, the exponent — g{f~ 1 {w m i n )) is negative for q max large 

enough, and thus, there is a threshold q t h such that for all q max > q t h, the condition (|20l) is 

satisfied. To be more accurate, it suffices to choose 

X (2N / / 64A16 jV , _ t 16A 2 \\ 
ftfc = / ( — x max<jlog( )J(g ( — - — )) > J ■ (21) 

Then, it follows from Lemma |5] that \\n t — fitWrv < l» whenever > + 1*. 
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Remark 4. The assumption g(0) > 1 is not required, since, as we saw in the above analysis, 
only the asymptotic behavior of g is important. If we choose q t u large enough such that 

g(f-\w mm (t)) - 1) > 1 (22) 

when \\q(t)\\ > qth, then ( f77| ) holds and the rest of the analysis follows exactly. In particular, in 
order to get an explicit formula for f~ x , we can choose g{x) = log(l + x) e for some < 6 < 1. 
The weight function for such a g is f(x) = (log(l + x)) 1 ^ 9 , and f^ 1 and has the closed form 

= exp(x^ =s ) - 1. 

Then ( 1271 ) yields 

( \2N ^ .64^16^, 2N AQN\ il^A 

Qth = ex p i max i — i°g( — 2 — ''~r(~~r~'\ J' 

It is easy to check that for q(t) > exp ^(^r) 1 ^ log(l + e)\ w m i n {t) > /(e) which satisfies 
$221) . Therefore, obviously, d22l) a/50 /zo/ds /or of ( |2?1) . 

The last step of the proof is to determine the constant B in ([141) . so we need to find t*. Let 

ti be the first time that q ma x{t) hits q th , then 

ti+t j ti+t 

-L I. 

ti+t 



e ~8Nf(q max (k)) 



k=t x k k=ti 



fe=ti 

ti+i 

= lO > e g(qmax(k)) 

tl+t 

= IQ- 2N J^i 1 + QrnaAk)) 
k=ti 
t 

> ^ + q th + k y 



8N 



g(qmax CO) 

fe=ii 

8JV 



S(lth) 

k=l 

27V. 



> 16- 21 "t(l + q th + t) ^7 
Therefore, by Lemma [51 it suffices to find the smallest t that satisfies 

SN M 

16- 2iV t(l + q th + t) > log(4/5) + - log(2(l + q th )) 



for a threshold q t h large enough satisfying (I2TI) . Recall that g{.) is an increasing function, 

8N 

g(Qth) 



therefore, by choosing q t h large enough, f N , can be made arbitrary small. Then a finite t* 
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always exists since 



lim t*(l + q th + t*) 9 ^th) = co. 

t*->oo 



In particular, for the function f(q) = (log(l + q)) 1 6 , < 9 < 1, and the choice of q th in (1231) , 
we have 



IN 



IN 



< 



Note that 



g(q th ) log(l + q th ) 2iV 

t ^ f l-e/2N 



(t + i + q th y/™ - (2+ gth y/ 2N 

and therefore, it is sufficient to choose t* to be 



t* 



(2 + q th )^lQ N \og[^(2(l + q th )) N / 2 



This concludes the proof of the main Theorem. 



(24) 



E. Extension of the proof to the distributed implementation 

The distributed algorithm is based on multiple site-update (or parallel operating) Glauber 
dynamics as defined next. Consider the graph G(V, E) as before and a weight vector W = 
[wi, w 2 , Wn). At each time t, a decision schedule m(t) C Ai is selected at random with 
positive probability a(m(t)). Then, for all i E m(t), 

(i) If Xj(t — 1) = for all nodes j G jV(«), then Xi(t) = 1 with probability 1 ^p(^-) ' or 
^(t) = with probability 1+c ^ (t5<) . 

Otherwise, Xi(t)-0. 

(ii) = — 1) for all j £ m(t). 

The Markov chain X(t) is aperiodic and irreducible if U me _\4 = V (See [31 for more detail). 
Also, it can be shown that X(t) is reversible, and it has the same stationary distribution as 
regular Glauber dynamics in ©. Here, we will assume that a min := min m a(m) > [1/2) N . 
Then, the mixing time of the chain is charachterized by the followng Lemma. 

Lemma 7. For the multiple site-update Glauber Dynamics with the weight vector W on a graph 
G(V,E) with \ V\ = N, 

T t <—exp{4Nti max ). (25) 

where w max = max ie y Wi. 
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18 19 20 21 



Fig. 1. A grid network with 24 links. 



See the appendix for the proof. The distributed algorithm uses a time-varying version of 
the multiple-site update Glauber dynamics, where the weights change with time. Although the 
upperbound of Lemma [7] is loose, it is sufficient to prove the optimality of the algorithm. The 
analysis is the same as the argument for the basic algorithm. Let D and W denote the lengths 
of the data slot and the control slot. Thus, the distributed algorithm can achieve a fraction D ^ w 
of the capacity region. In particular, recall the simple randomized machanism, in section IIII-BL 
where each node joins the decison schedule by sending an INTENT message with probability 
1/2. Note that in this case a min > (1/2) N , and also it sufficies to allocate a short mini-slot at 
the begining of the slot for the purpose of control. By choosing the data slot to be much larger 
than the control slot, the algorithm can approach the full capacity. 

V. Simulation Results 

In this section, we evaluate the performance of diffrent weight functions via simulations. For 
this purpose, we have considered the grid network of Figured! which has 16 nodes and 24 links, 
under one hop interference constraint. Consider the following maximal schedules 

Mi = {1,3,8,10,15,17,22,24} 
M 2 = {4,5,6,7, 18,19,20,21} 
M 3 = {1,3, 9, 11, 14, 16, 22, 24} 
M 4 = {2,4,7,12,13,18,21,23} 
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With a little abuse of abuse of notation, let Mj also be a vector that its i-th element is 1 if 
i G Mj and otherwise. We consider arrival rates that are a convex combination of the above 
maximal schedules scaled by < p < 1, e.g., 



\ = p^2c i M i , c = [0.2,0.3,0.2,0.3]. 



i=l 

Note that, as p — >■ 1, A approaches a point on the boundary of the capacity region. We simulate 
the distributed algorithm, and use the following randomized mechanism, as in 0, similar to 
IEEE 802.11 DCF standard, to generate the decision schedules in the control slots. At time slot 

t: 

1) Link i selects a random back-off time Tj uniformly in [0, W — 1] and waits for Tj control 
mini-slots. 

2) IF link i hears an INTENT message from a link in M(i) before the (Tj + l)-th control mini- 
slot, i will not be included in m{t) and will not transmit an INTENT message anymore. 

3) IF link i does not hear an INTENT message from any link in M{i) before the (Tj + l)-th 
control mini- slot, it will broadcast an INTENT message at the beginning of the (Tj + l)-th 
control mini-slot. Then, if there is no collisions (i.e., no other link in M{%) transmits an 
INTENT message in the same mini-slot), link i will be included in m(t). 

Once m(t) is found, the access probabilities are determined as described in the distributed 
algorithm in section IIII-BI Here, we choose W = 32 (which is compatible with the back-off 
window size specified in IEEE 802.11 DCF). 

In our simulations, the performance of log(l + x) and i g(e+iog(i+a;)) * s ver y c l° se to eacn 
other, so in the plots, for brevity, we use the name log while the results actually belong to the 

log(l+x) 



function . , , . . -. 

Iog(e+log(l+x)) 

Figure [2] shows the average queue-length evolution (total queue-length divided by the number 
of links), for the weight functions f(x) = lo J°+[^l\. x)) and f(x) = loglog(e + x) and for 
loadings p = 0.8 and 0.82. While both functions keep the queues stable, however as it is 
expected, the average-queue lengths for the weight function are much smaller than those 

for log log. Moreover, x ^°f yields a faster convergence to the steady state. The performance 
gap of two functions, in terms of the average queue-length and the convergence speed, increases 
significantly for larger loadings; for example see Figure [3] for p = 0.85. Figures @] and [5] show 
the delay performance (time-average queue-length per link) of the two weight functions under 
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Fig. 2. The evolution of average queue-length for log log and lo ^f og (called log in the plots). 



4500 




Fig. 3. The evolution of average queue-lengths for p — 0.85. 



different loadings. As it is evident from the figures, log has a significantly smaller delay than 
what is incurred by using the weight log log. A natural question is that whether there exists a 
function faster than log-wise functions that still stabilizes any general network. If such a function 
exists, then one will expect to get a better delay performance. Our conjecture is that, since the 
mixing time is, in general, exponential in w max , log is the fastest weight function that can make 
the network change in an adiabatic manner, and hence keep the system close to its equilibrium 



19 




P 



Fig. 4. Time-average of queue-length per link for low and moderate values of p. 




(stationary distribution). We tried faster weight functions, such as q and Jq, but they resulted 
in unstable systems (for example see Figure [6]). 

VI. CONCLUSIONS 

In this paper, we considered the design of efficient CSMA algorithms that are throughput 
optimal and have a good delay performance. The algorithm is essentially a Glauber Dynamics 
with, potentially, multiple-site updates at each time-slot. Access probabilities depend on links 
weights, where the weight of each link is chosen to be an appropriate function of its queue- 
length. In particular, we showed that weight functions of the form f(q) = log(q)/g(q) yield 
throughput-optimality and low delay performance. The function g(q) can grow arbitrarily slowly 
such that f(q) w log(g). 
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4 6 
Time steps 



Fig. 6. The weight function ^/q makes the system unstable (p = 0.92). 
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Appendix A 

Proof of LemmaUl' Let / be strictly increasing with /(0) = 0, therefore lim^oo f(q) = oo. 
Consider an arbitrary e > and M 2 > 0. We need to to find a Q > such that for all q > Q, 

f(q + M 2 )<(l + e)f(q) 



or equivalently, 



Since / is concave^, 



f(q + M 2 )-f(q) 



< e. 



/(?) 

f(q + M 2 )-f(q) ^ f(q) 



< 



Mo. 



/(?) " m 

It also follows from concavity that /' is nonincreasing, and f'(q) > for all q > since / is 
strictly increasing. Therefore f'(q) must have a limit as q — > oo while f(q) grows to infinity. 
Therefore, by choosing q large enough, j^j- can be made smaller than e/M 2 ■ This concludes 
the proof of the upper bound. The proof of the lower bound follows similarly. ■ 



3 We have implicitly considered the class of continuously differentiable functions. 
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Appendix B 

Proof of Lemma \3\- The upper-bound in Lemma [3] is based on the conductance bound flU, 
0. First, for a nonempty set B C E, define the following s: 

tt(£) = 5>(z), 

ieB 

F{B)= Yl <*)P* 
ieB,jeB c 

Then, conductance is defined as 

4>{P) = inf 

B-.TT t (B)<l/2 Tr(B) 

Lemma 8. (Conductance Bounds) 

<b 2 (P) 

1 - 20(P) < A 2 < 1 - 
The conductance can be further lower bounded as follows. 

B:n(B)<l/2 7r(B) 
~ X^B,Y&B c 

> 2minvr(X) minP(X,y) 

For our Glauber Dynamics, the stationary distribution is lower bounded by 

1 1 

7r(p) > — > . 

Y. P ex P(J2iep w i) exp(iVw 

max ) 

In addition, X and Y can differ in at only one site, and it is easy to see that 

1 1 



P(X,Y) > 



So 



<KP) > 



N 1 + exp(uv, 
1 



> 



A^-^l + exp(w max )) exp(Nw 

1 



Therefore, 



jV2*exp((jV + l)«w)" 

HP) < 1 



< i 



2A^exp(2(iV + l) W w) 
1 



16* exp(4Nw r , 



By Gershgorin's theorem (e.g. see the appendix of H), for a stochastic matrix [Py], 



A > — 1 + 2minPii. 

For our Glauber-Dynamics, 

P > i 1 V 1 1 V exp(wj) 

N 



AT ^ 1 



^ 1 + exp(w Tl 
1 



So, 



Therefore, 



A r > -1 



1 + exp(w max ) 

2 1 — exp(uVi 



1 + exp (w max ) 1 + exp (w 

r, 



max{A 2 , |A r |} = A 2 
and the SLEM of P is upperbounded by 

1 

o < 1 



16* exp(4Nw r , 
Consequently 

T< 16 N exp(ANw ma: 



Appendix C 

Proof of Lemma 0' 

The corresponding stationary distributions at times t and t+1 are respectively given 

Mp) = 4exp(^Wi(t)), 



Z 



and 

IT i i if'/.)) : 



^+i(p) = 4exp(^Wi(t + l)), 



So 



^±i^ = exp(V^(t + 1) - Wi(t)) 
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or 

exp 



\&i(t + 1) - €>i(t)\ \ < ^±1^1 < exp (j2 + !) - ) • 



Let g t * denote / 1 (w min (t)), and g(t) = max{g f *, q(t)}, where g(t) is the vector of queue 
lengths at time t. Recall that / is a concave and increasing function. Hence, if q^t + 1) > qi(t), 

wi(t + 1) - = fm + 1)) - fm)) < fm))m + 1) - *(*)) < /'»))■ 

(note that ^(t + 1) and ^(t) at most differ by one since there can at most one packet arrival or 
departure in a time slot). Similarly if q^t + 1) < - /(?»(* + !)) < + !))• 

So 

i^(t + 1) - < + f (q^t + 1)) < + 1) - 1) + fm + 1)) 

and therefore 

\w i (t + l)-w i (t)\<2f'(q*(t + l)-l). 

Define 

a t = 2iV/V(* + l)-l), (28) 

then 

e-* < < e ai . (29) 

The drift in 7r t is given by 

iu ^ ii 2 - ii !*j 1 1 1 2 

iFt+i -TTtHi/ - ||- 1|| 

»t+l 

p ^+i(p) 

< ^7T t+1 (p)m a x{(e--l) 2 ,(l-e--) 2 } 

p 

< max{(e ai - l) 2 , (1 -e~ Qt ) 2 } 
= (e a '-l) 2 

- TTtlll/irt+i < 2«( (30) 

a, = 2Nf(f- 1 (w min (t + 1)) - 1). (31) 



for a t < 1. Thus, 



for a t < 1, where 
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The distance between the true distribution and the stationary distribution at time t can be bounded 
as follows. First, by triangle inequality, 

WlM - ^tlll/TTt < \\Ht - TTt-llll/TTt + IKt-l - ^t\\l/n t 
< \\Ht - 7rt-l||l/7r t + 2d! t _i. 

On the other hand, 

Win -n-i\\l/ nt = 52^fiMp)-*t-i(p)) 2 

p MP) n t -i{p) 

Therefore, for a t < 1, 

II — -M\n t < (l + at-i)||//t-7r t _i||i/ 7rt _ 1 +2a t _ 1 . 
Suppose a t < 5/16, then ||^ - l|| Wt < 5/2 holds for t > t*, if 



2 



||/it - 7Tt-l||l/7r t _i < 5/4 

for all i > t*. 

Define a t = \\/j, t+1 - n t \\ 1/7Tt . Then 

a t+l = \\Pt+2 — 7Tt+l || 1/nt+i 

— Wfa+lPt+l — 7I"t+l||l/7r t+ i 
< °"mll^+l — 7Tt+l||l/7r t+ i 

where a t+ i is the SLEM of P t +i, since (P t+1 ,7r t+ i) is reversible. Therefore, 

Ot+i < cr t+ i[(l + a t )a t + 2cc t ] 
Suppose a t < 5/4. Defining T t = j^-, we have 

a t+1 <(l--^-)[V4 + (2 + V4)« t ]. 

Thus, a t+ i < 5/ A, if 

(2 + <f/4)a t < ^-(5/4 + (2 + 5/4)a t ), 
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or equivalently if 

5/4 



. Tt+i 

a t < 



(2 + 5/4)(l-l/T t+i y 
But i/1 JUL x , 

Tt+i Tt+i > 



(2 + <f/4)(l - 1/T m ) 4(1 - l/T t+1 ) 16 T t+1 
so, it is sufficient to have 

OitTt+i < 5/16. 

Therefore, if there exists a time t* such that a t * < 5/4, then a t < 5/4 for all £ > i*. To find t*, 
note that a t > 5/ A for all t <t*. So, for i < f, we have 

^)[(1 + at-i)at-i + 2a t -i] 
J* 

^)[(l + a t _ 1 )a t _ 1 + 2a t _ 1 4^i] 

— )(1 + a t -i + -a t _ija t _i 

1 W1 (J/16,, 8,, 
-)(l + ^(l + -))a t _ 1 

^)(l + ^)at_i 



Thus, 



where 





(1 


< 


(1 


< 


(1 


< 


(1 


< 


(1 




(1 


< 


e 



T 7 

* <h-i 

Z^fc=l ~^7Z 

a t < a e ^ , 

II Til 

a o — II J-IUo 

= H/^O-Po — 7To||l/iro 
< (7(Po)||//o-7ro||l /7ro 
1 

< 
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and 

7r™ m = min7r (p) 
p 

1 

Ep ex P(E iGp ^(0)) 
1 

\M\exp(Nw max (0)) 



which yields 

a < (2e Wma * (0) ) N/2 - 



Putting everything together, t* must satisfy 

( 2e — (0)^/2^=1% < J/4 



or as a sufficient condition 

t* 



^2 > MV*) + N(w max (0) + log2)/2. 



T 2 
fc=i k 



Appendix D 

Proof of Lemma 0" 

Let jMo C X be the set of all possible decision schedules. Given X(t) = X, for some 
X E M, the next state/schedule could be X(t+ 1) = Y with the following transition probability 

P (x,Y)= yi «h n iriM n tt&' <32) 

where XAF = U (Y\X). 

The upper-bound in Lemma [7] is based on the conductance bound as in the proof of Lemma 
[3l Recall that the conductance can be lower bounded as follows. 

(j)(P) > 2 min x n(X) mm x ^ Y P(X, Y) 
As in the regular Glauber Dynamics, 

n ^ X) ~ 2 N exp(Nw max )' 
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and, 

/ 1 \ N 

P(X, Y) > a mil 
where a min = min mgMo a (to) > ^r. Hence, 



1 + exp(w max ) 



4>{P) > 2 



> 



A^(l + exp(w max )) N exp(Nw 

max J 

2 



8 N exp(2Nw max )' 
Therefore, based on the conductance upperbound, 

2 



HP) < i 



64^exp(4iV Wma:c ) 
and by Gershgorin's theorem, 

9 

A r > -1 + 



2 N (l+exp(w max )) N ' 
Therefore, 

max{A 2 , |A r |} = A 2 
and the SLEM of P is upperbounded by 

2 

o t < 1 



64 N exp(4Nw max y 

Consequently 



64^ 

T< —exp(4Nw max ). (33) 
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